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Abstract 

Recent studies have revealed a number of striking dependence patterns 
in high frequency stock price dynamics characterizing probabilistic interrela- 
tion between two consequent price increments x (push) and y (response) as 
described by the bivariate probability distribution V(x, y) [H [21 EH E] • There 
are two properties, the market mill asymmetries of V(x, y) and predictabil- 
ity due to nonzero z-shaped mean conditional response, that are of special 
importance. Main goal of the present paper is to put together a model re- 
producing both the z-shaped mean conditional response and the market mill 
asymmetry of V(x, y) with respect to the axis y = 0. We develop a proba- 
bilistic model based on a multi-component ansatz for conditional distribution 
V(y\ x) with push-dependent weights and means describing both properties. 
A relationship between the market mill asymmetry and predictability is dis- 
cussed. A possible connection of the model to agent-based description of 
market dynamics is outlined. 
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1 Introduction 



This paper is based on recent results on high frequency conditional dynam- 
ics in the stock market [H [2J [3j 0]. In [2J [31 H] we have described several 
newly discovered dependence structures characterizing high frequency stock 
dynamics - the market mill patterns corresponding to various asymmetries 
characterizing the bivariate probability distribution V(x, y) of two consecu- 
tive price increments x (push) and y (response). We have also discussed a 
number of effects that are best described in terms of moments of the condi- 
tional distribution V(y\x) of response y at given push x. All the conditional 
moments studied (mean, standard deviation, skew, normalized hypercum- 
ulant) reveal pronounced dependence on the push x. A nonzero z-shaped 
mean conditional response implying probabilistic predictability of a price in- 
crement from knowing the previous one is of special interest. In this paper we 
will focus on building a probabilistic model for both the nonlinear z-shaped 
mean conditional response and the corresponding market mill asymmetry 
pattern of V(x, y) with respect to the axis y = 0. 

The issue of price predictability is perhaps the most important one in 
theoretical finance. In the ideally efficient market it is impossible to predict 
future price increment using historical data, e.g values of the preceding incre- 
ments. If one describes price dynamics as a stochastic process, it should have 
zero conditional mean (y) x = 0, i.e. be a martingale, see e.g. [SJElEj- Let us 
stress that an existence of nonlinear dependence patterns related to higher 
moments of the conditional distribution does not contradict the martingale 
property and thus the weak-form market efficiency. Such dependencies were 
intensively studied, especially in the framework of conditional regressive dy- 
namics [3 [S] • 

An appearance of the nonlinear mean conditional response shows that 
within a standard paradigm of regression models formulated as a noisy map- 
ping of the push x into the response y, i.e. y = f(x) + e, the mapping 
f(x) = (y) x is nonlinear. Thus we are dealing with a nonlinear dynamical 
system, see e.g. [HI HOI- I n financial applications a particular class of such 
models, the threshold autoregression models [UJ were used in describing the 
properties of interest rates, see e.g. [12]. 

At a fundamental level it is necessary to describe the nonzero nonlinear 
mean conditional response in terms of market inefficiencj{§ leading to prob- 

3 Herc we refer to an idealized definition of market inefficiency in which the effects like 
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abilistic predictability. Some examples of such predictability were discussed 
in [5]. Recently the issue of probabilistic predictability was discussed in the 
context of agent-based modeling of financial market dynamics |13j . 

At a phenomenological level we need to build a probabilistic model ex- 
plaining an origin of both the nonlinear predictability of the mean conditional 
response and the corresponding market mill pattern within a simple and in- 
tuitive probabilistic model. This is the main objective of the present paper. 

When constructing a probabilistic model describing both the nonlinear 
conditional response and the market mill phenomenon we shall employ a step- 
by-step approach. First, we provide an analytical description of the mean 
conditional response (y) x . Second, we consider a simple dynamical system's 
version of noisy conditional dynamics characterized by the observed nonlinear 
mean conditional response dressed by an additive noise. We show that this 
picture does not allow to reproduce the market mill asymmetry associated 
with conditional response. Finally, we present a version of noisy conditional 
dynamics characterized by the push-dependent mixture of conditional distri- 
butions allowing to reproduce both the nonlinear mean conditional response 
and the corresponding market mill asymmetry and suggest its interpretation 
in the framework of agent-based modeling of financial markets. 

The outline of the paper is as follows. 

In Section 2 we start with describing in paragraph 2.1 basic quantitative 
characteristics of the asymmetry of the conditional response and an algorithm 
allowing to reconstruct a full bivariate distribution V(x, y) from a given con- 
ditional distribution V(y\ x). In paragraph 2.2 we discuss a single-component 
conditional distribution corresponding to conventional noisy conditional dy- 
namics that, by construction, reproduces an observed nonlinear dependence 
of the mean conditional response. We show that this model gives rise to an 
asymmetry pattern very different from the market mill one. In paragraph 2.3 
a multicomponent conditional distribution of response at given push V(y \ x) 
is proposed. Its simple version with constant weights considered in paragraph 
2.3.1 is shown to reproduce the market mill pattern but not the nonlinear 
mean conditional response. Two versions with push-dependent weights de- 
scribed in paragraph 2.3.2 are shown to produce both the market-mill shaped 
asymmetry of the asymmetric component of V(x,y) and the nonlinearity of 
the conditional mean response (y) x - In paragraph 2.4 we summarize the 

bid-ask spread and trading costs limiting potential profit based on specific predictability 
under discussion are discarded. 
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model-dependent relationships between the nonlinear predictability and the 
market mill pattern. We proceed with comments on possible relation of the 
proposed probabilistic model to agent-based modeling of financial dynamics 
in paragraph 2.5 and conclude the section with a discussion of the limitations 
of the proposed probabilistic model in paragraph 2.6. 

In Section 3 we summarize the results of the present paper. 

2 Modeling the conditional response 
2.1 General considerations 

At the basic level a probabilistic interrelation of the push x and the response 
y is quantified by the bivariate distribution V(x, y). As already mentioned in 
the Introduction, in the preceding papers [2j [3j H] we have described a number 
of interesting features of V(x,y). However, from the modeling standpoint 
it turns out convenient to start with specifying a conditional distribution 
V(y\ x). Note that knowledge of V(y\ x) and the marginal distribution V(x) 
fully specifies V(x,y) = V(y\x)V(x). In the considered case of equal time 
intervals the marginal distributions V(x) and V(y) are the same. Thus for 
given conditional distribution V(y\ x) the marginal distribution V(x) should 
satisfy the following consistency equation 



The adopted procedure of constructing the bivariate distribution V(x,y) 
starting from a given V(y \ x) goes in two steps: 

• Constructing the conditional distribution V(y \ x). 

• Specifying a reasonable marginal distribution V(x) satisfying the con- 
sistency condition (Tj[|). 

In the present paper we shall focus on building a model reproducing two 
phenomena described in [21 El 0] : 

• Market mill asymmetry of V(x,y) with respect to the axis y — . 

• Predictability of response at given push due to the nonlinear z - shaped 
mean conditional response (y) x . 




(1) 
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These effects are illustrated in Figs. 1,2 correspondingly. 

Both above-described features originate from the nontrivial asymmetric 
component V a (x,y) of the distribution V(x,y): 

V a = ±(V{x,y)-V{x,-y)) (2) 

The market mill dependence pattern refers to the specific shape of the pos- 
itive component V a( - P \x,y) = V a (x,y) ■ [V a (x,y)] of V a (x,y), where 9 is 
a Heaviside step function. In turn, an emergence of the nonlinear ^-shaped 
mean conditional response (y) x is quantified by the corresponding proper- 
ties of the asymmetric component V a (y\x) of the conditional distribution 
V(y\x)=V(x,y)/V(x): 

(y) x = [ dyyV a (y\x) (3) 



2.2 Single-component conditional distribution 

Let us first consider the following single- component conditional distribution: 

V{y\x)=V{y-m{x)), (4) 

where m(x) = {y) x is the mean conditional response having characteristic z 
- shaped form [2]. To be concrete let us parametrize the push dependence 
of the mean conditional response in the case where x and y correspond to 
consecutive price increments in 3-minutes time intervals as follows: 

m{x) = (y) x = [0.14 - 0.24 | x\ °' 15 ] • x (5) 

In Fig. 3 we plot the analytical approximation ([5]) together with correspond- 
ing market data from Fig. 1. 

To completely specify the conditional dynamics (jl]) we have to choose a 
functional form of the distribution V(y\ x) = V(e), where e = y — m(x). 
Our choice is a Laplace distribution V L {e) = (0.5/cr) exp(— | e\/a) with a = 
$ 0.05fl This selection is motivated by the tent-like shape of the loga- 
rithm of the the central part of probability distribution of high-frequency 

4 This choice of a corresponds to a mean absolute price increment at 3-minutes scale 
(| 5p |) = $ 0.052 for the ensemble of stocks and time period used in [2]. 
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price increments, see e.g. [3j. The conditional distribution is thus given by 
V(y\ x) = V L (y — m(x)). 

After having specified the conditional distribution V(y\ x) and the marginal 
distribution V(x) we have to check that this choice is consistent, so that these 
distributions satisfy Eq. ([I]). We have numerically checked that Eq. ([1]) is 
indeed satisfied by V(y — m(x)) and V L (x) with above- described parameters. 
Thus the full bivariate probability distribution V(x, y) is completely specified. 
Its antisymmetric component is plotted in Fig. 4 (a). We conclude that re- 
constructing the conditional dynamics as described by the single-component 
distribution (@J with appropriate push - dependent mean ([5]) leads to the 
two-dimensional asymmetry pattern very different from the market mill one 
in Fig. 2. Thus here we have a model with proper nonlinear mean conditional 
response but with wrong asymmetry pattern. 

Let us note that the probabilistic model (T4"f5]) corresponds to a dynamical 
system with nonlinear mapping x m(x) dressed by additive noise e with 
distribution V{e) and zero mean (e) = 0: 

y = m(x) + e (6) 

We see that a description in terms of a dynamical system ([6]) explains (by 
construction) the predictability related to the nonlinear ^-shaped dependence 
of the conditional mean response on the push but leads to the push-response 
asymmetry structure very different from the observed market mill shape. 

2.3 Multi-component conditional distribution 

Let us now construct a conditional distribution V(y \ x) ensuring the ^-shaped 
mean conditional response in a different fashion. From the analysis of [21 [3], 0] 
we know that the asymmetry in question is relatively weak, so that a domi- 
nant component of V(y \ x) should be symmetric with respect to the reflection 
y — > —y and, correspondingly, its dominant peak should be at y = 0. In ad- 
dition to the symmetric component V°(y\ x) having a constant weight w° 
the distribution should include two asymmetric components V ± {y \ x) having 
push- dependent weights w ± (x): 

V(y\x) = w + (x)P + (y\x) + w°V°(y\x) + w~ (x) V (y \ x) (7) 

where w + (x) + w° + w~(x) = 1. Below we shall take w° = 0.85. The dis- 
tributions V ± (y\ x) have peaks at yf such that sign(y^) = ±sign(x), so that 
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the component V + (y\ x) corresponds to a trend-following response, whereas 
the component V~(y \ x) corresponds to a contrarian one. Below we shall use 
a simple parametrization of m (x) ensuring these properties: 

(y)+ = m + (x) = (1 + q) x 
(y)° x = m°(x) = 

(y)x = m~(x) = -(l-q)x, (8) 

where ee J dyyV + (y \ x), and q is a parameter responsible for generat- 
ing the asymmetry in question, for which we will take a value of q = 0.1£@. 
In Fig. 5 we plot the full conditional distribution V(y \ x = 0.1) and its three 
components using the weights w° = 0.85, w + = 0.015 and w~ = 0.135. Let 
us note that a direct observation of multi-modal structure based on market 
data can be very difficult. As could be seen from Fig. 5, a presence of two 
asymmetric contributions leads to a very slight distortion of the full condi- 
tional distribution. At the same time their impact on asymmetry-sensitive 
quantities like the mean conditional response or the market mill asymmetry 
can be very pronounced. 

A simple calculation shows that the mean conditional response corre- 
sponding to Eq. fl7J) takes the following form: 

(y)x= [(w + (x) — w~(x)) + q (w + (x) + w~(x))~\ x (9) 
2.3.1 Constant weights w ± 

It is instructive to start with the case of constant weights w ± . From Eq. (Q 
we see that in this case one gets a purely linear correlation between the 
conditional mean response and the push. To analyze the two-dimensional 
asymmetry pattern one has to specify the functional form of the distributions 
V ±,0 (y \ x). Let us assume that all three are Laplace distributions 

V^\y\m^\x),a) = ±- exp {-\y - m±'°(x) (10) 

with common width parameter a. We choose = 0.075, q = 0.15 and 
a = $ 0.052. Using the consistency condition Eq. (TjQ) we have checked that 

The values of parameters contained in the model distribution V{y\ x) in the expressions 
for weights w and parameters of the component distributions are fixed by fitting the 
resulting mean conditional response to the market data corresponding to the time scale of 
3 minutes. 
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the distribution (jTJ) with constant weights w ± indeed solves ([I]) so that we can 
reconstruct the full two-dimensional distribution V(x,y) and its asymmetric 
component V a . The resulting asymmetry pattern is shown in Fig. 4 (b). 
We see that it has a characteristic market mill shape. We conclude that a 
model based on the distribution ([7]) with constant weights w ± does reproduce 
the market mill asymmetry pattern but does not reproduce the nonlinear z- 
shaped mean conditional response. 

2.3.2 Push-dependent weights w ± (x) 

The experimentally observed dependence does have a pronounced nonlinear 
z-shaped form, see Fig. 1 and [2], so we have to correct our model in order 
to reproduce it. In fact, to generate such nonlinear dependence it is suffi- 
cient to consider of push-dependent weights w ± (x). To reproduce the 
z-shaped pattern of Fig. 1 we have to ensure a bias towards trend-following 
behavior, (i.e positive slope of (y) x vs the push) at small nonzero x and a bias 
towards contrarian behavior (negative slope of (y) x vs the push) at large x. 
The components of the conditional distribution Eq. ([7]) responsible for trend- 
following and contrarian behavior are obviously V + (y \ x) and V~(y \ x) cor- 
respondingly, so we have to choose some appropriate parametrization of the 
weights w ± . A simple illustration leading to the conditional mean response 
shape qualitatively similar to that in Fig. 1 is provided by 

1 — w° 

w^x) = — ^— {lT\x\) => (y) x = (l-w°)x(q-\x\) (11) 

From Eq. fTTTT) we see that the role of the parameter q is in fixing the scale 
at which the nonlinear mean conditional response (y) x changes its sign. 

To reproduce the shape of conditional mean response close to the observed 
one, see Fig. 1, we shall use a somewhat more complex parametrization of 
the weights w ± (x): 




w + (x) = l-w°-w~(x) (12) 

Choosing w° = 0.85, w a = 0.05, q = 0.25, p = 0.5 and o = $0,052 gives 
an alternative parametrization of conditional response, cf. Eq. (jSJ). The 
resulting asymmetry pattern is plotted in Fig. 4 (c) and the push dependence 
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of the weights w (x) is illustrated in Fig. 6. Let us note that such fine details 
of the structure of the asymmetry of the empirical distribution V(x, y) shown 
in Fig. 2 as the shape of equiprobability lines and the varying form of mill 
blades are, as seen in Fig. 4 (), reproduced by our model . Let us stress 
that a bias towards trend-following behavior at small pushes and towards 
contrarian behavior at large ones results from combination of x - dependent 
weights and means. 

Let us also emphasize that although it is formally possible to rewrite the 
conditional dynamics described by the three-modal distribution with push- 
dependent weights Eq. (|7j) in the form of Eq. (|4f6|) with some very intricate 
noise distribution V(e), this procedure looks extremely unnatural. In this 
sense the three-component conditional distribution Eq. (J7j) presents a really 
different view on conditional dynamics than the conventional Eq. ([6]). 

Thus a model based on the conditional distribution © with specially 
chosen push-dependent weights w ± (x) allows to reproduce both the nonlinear 
mean conditional response and the market mill asymmetry pattern. 

The above-discussed procedure of constructing the conditional distribu- 
tion V(y\x) is by no means unique. For example, ways of introducing the 
response asymmetry can be different. In particular, in the above-discussed 
parametrization Eqs. (l8][T2l the asymmetry in question was ascribed to an 
asymmetric structure of the means of its components V{y\x) ± . 

Let us now introduce another model of the same class which is also based 
on the three-component conditional distribution V(y\x). In this model the 
weights are the same as in Eq. ffl2l . but an asymmetric dependence of 
the characteristics of the component distributions is "shifted" from the means 
m ± (x) to the widths a ± (x): 

m + (x) = x, 
m°(x) = 0, 
m _ (x) = —x, 

The resulting asymmetry pattern is shown in Fig. 4 (d). All the results 
obtained using the model of Eqs. (11211131) are similar to those obtained with 
Eqs. (Ed. 



a + (x) = 0.005 + 0.065 
a = 0.03 

a -(x) = 0.01 + 0.025 



0.5 



0.3 



0.5 



0.3 



(13) 
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The models of Eqs. (JHH2D and (I12|I13I) are just two examples from a long 
list of models that describe the market mill asymmetry with respect to the 
axis y = and the nonlinear mean conditional response. Our choice was 
motivated by their transparent logical structure. 

2.4 Relationship between nonlinear predictability and 
market mill asymmetry 

At this point it is appropriate to summarize the relationship between the phe- 
nomena of nonlinear predictability due to nontrivial push-dependent mean 
conditional response and the market mill asymmetry pattern characterizing 
the asymmetry of the bviariate distribution V(x,y) with respect to the axis 
y — 0. Our considerations in the paragraphs 12.21 and 12.31 have shown that 
this relationship is model - dependent. In particular: 

• The single- component model (jUEJ) of conditional dynamics describes 
the nonlinear predictability (i.e. the ^-shaped mean conditional re- 
sponse) but gives a wrong asymmetry pattern of V(x,y). Thus the 
nonlinear mean conditional response does not constitute a sufficient 
condition for the existence of the market mill asymmetry. 

• The multi-component model (ITIIBl) with constant weights describes 
the market mill asymmetry pattern but not the nonlinear dependence 
of the mean conditional response on push. Thus the market mill asym- 
metry does not constitute a sufficient condition for the existence of the 
nonlinear mean conditional response. 

2.5 Market mill from the agent-based perspective 

Let us now discuss possible origins of the market mill asymmetry in the 
framework of agent-based description of financial market dynamics. 

A direct link between the agent's strategies and price evolution is provided 
by the relation between the sign of market orders and the resulting change 
in price, see e.g. [H]. In the discrete time formulation this corresponds to a 
dependence of price increment Sp t = Vt+i ~ Pt on cumulative sum of signed 
orders Q t = V t + — V t ~ placed at time t, where V ± is a volume of buy (+) and 
sell (-) orders. With the simplest assumption of linear impact 

5 Pt = \n t . (14) 
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Assuming constant proportionality coefficient A, the probability distribution 
of price increments is a rescaled version of the probability distribution of the 
signed volume: 

P(Qt) — > V(5 Pt ) (15) 

The distribution V(Qt) provides a probabilistic description of agent's 
strategies realized through buying (selling) a certain number of stocks or just 
doing nothing at time t. Let us consider a simple case when such trading 
decisions depend on the preceding price increment Spt-i, so that 

r(6 Pt ) = jV(n t \Sp t -i). (16) 

Within this framework it is natural to classify agents into three groups, trend- 
following, contrarian and noise, characterized by probability distributions 
V + (fi t | Spt-i), V~ (Q t \ 8pt-i) an d V° (fit | Spt-i) correspondingly The dis- 
tributions V ±,Q (f2f| dpt-i) are biased in such a way that 

sign ( {V ± (fit | 5pt_i)) ) = ±sign (6p t -i) 

and ( V (Q t \ 5pt-i) ) = 0. The trend-followers are betting that the sign 
of the next price increment is on average the same as that of the previous 
one, the contrarians bet on sign reversal and noise traders make random 
decisions. Generically the yields w ±,Q of the trend- following, contrarian and 
noise strategies depend both on the sign and magnitude of 5p t -i- Thus 

V{Sk\Sp t -x)= J2 w\ht-i)-V l {n t \5p t ^) (17) 

»=+,-, 

which is, due to (TH1) . precisely the conditional distribution ([7j). Therefore 
the three components of the conditional distribution V {VL t \ 5pt~i) in Eq. (|17[) 
(trend-following, contrarian and noise) correspond to the three components 
of the conditional distribution V(y\x) in Eq. (j7]). 

2.6 Limitations of the model 

In the present paper we have focused on building a probabilistic model de- 
scribing the market mill asymmetry pattern corresponding to one particular 
asymmetry of V(x,y), that of reflection y — ► —y. As shown in [2], the full 
empirical bivariate distribution is in fact characterized by several asymmetry 



11 



patterns having the market mill shape, e.g. that corresponding to the reflec- 
tion with respect to the axis y = x. We have checked that with the three- 
component conditional distribution Eq. (jTJ) one can not reproduce the market 
mill pattern corresponding to this last asymmetry. In fact, our model is tai- 
lored to describe only one particular asymmetry, that of conditional response. 
Constructing a probabilistic description of the full asymmetry structure of 
V(x, y) remains a task for the future. 

Let us also note that even with the asymmetry pattern under consider- 
ation the model conditional distribution ([7]) does not allow to reproduce all 
details of the empirically observed pattern. 

3 Conclusions 

Let us summarize the main results obtained in the paper: 

• A probabilistic model based on the multi-component model for con- 
ditional distribution V(y \ x) reproducing the nonlinear z-shaped mean 
conditional response and the market mill conditional response asym- 
metry pattern was constructed. 

• We demonstrated that a single-component model corresponding to con- 
ventional noisy conditional dynamics with built-in z-shaped mean con- 
ditional response does not allow to reproduce the market mill condi- 
tional response asymmetry pattern. Thus an existence of the z-shaped 
mean conditional response does not imply the market mill asymmetry 
pattern. 

• Consideration of the case of push-independent weights in the multi- 
component model conditional distribution V(y\ x) showed that the mar- 
ket mill asymmetry pattern can coexist with the usual linear depen- 
dence of the mean conditional response on push. Thus an existence of 
the market mill asymmetry pattern does not imply the z-shaped mean 
conditional response. 

• A possible link of the discussed probabilistic model with agent-based 
description of market dynamics was outlined. 
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Figure 1: Mean conditional response versus push (from [2]). 
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Asymmetry with respect to y=0 axis 
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Figure 2: Two-dimensional projection of the asymmetry of the bivariate 
distribution log 2 (V(x,y)) (from [2]). Borders between different colors show 
equiprobability lines. 
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Figure 3: Mean conditional response: model versus data. 
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Figure 4: Two-dimensional projection of the asymmetry of the bivariate dis- 
tribution P(x,y). (a) Single-component model, (b) Multi-component model, 
constant weights, (c) Multi-componeU model, push-dependent weights, (d) 
Multi-component model, push-dependent volatility. Borders between differ- 
ent colors show equiprobability lines. 



9.00 n 



1 




-0.30 -0.25 -0.20 -0.15 -0.10 -0.05 0.00 0.05 0.10 0.15 0.20 0.25 0.30 

x (push), $ 



;ure 5: Components of conditional distribution V(y \ x) at x — 
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